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ABSTRACT 

We present a Software Keyboard for smart touchscreen de¬ 
vices that learns its owner’s unique dictionary in order to 
produce personalized typing predictions. The learning pro¬ 
cess is accelerated by analysing user’s past typed communi¬ 
cation. Moreover, personal temporal user behaviour is cap¬ 
tured and exploited in the prediction engine. Computational 
and storage issues are addressed by dynamically forgetting 
words that the user no longer types. A prototype implemen¬ 
tation is available at Google Play Store. 

Categories and Subject Descriptors 

H. 2.8 [Database Applications]: Data mining; C.5.3 
[Computer System Implementation]: Personal Com¬ 
puters 

General Terms 

Algorithms 

Keywords 

software keyboards, text predictions 

I. INTRODUCTION 

Over the last few years smartphones and tablets, have 
achieved mass adoption. Currently, smart devices possess 
impressive computational and sensing capabilities. However, 
little improvement has been achieved with respect to their 
(software) keyboards (SKs). 

Contemporary SKs are more difficult to use, due to lack 
of typing feedback, large number of keys and small screen 
size. On the other hand, the need for written communication 
is increasing. Popular everyday applications include chat 
messaging, e-mail exchanging, micro-blogging, participation 
in social networks and even editing documents. 

With the rise of multitouch capacitive touch-screens, var¬ 
ious research approaches |13[ and corporate efforts 
tried to improve software keyboards. However, none of them 
managed to be successfully adopted yet. Existing commer¬ 
cial SKs, are based on generic dictionaries adapting to 
personal typing behavior by learning new words and phrases 
that the user commonly uses. Generic predefined dictio¬ 
naries maintain a large amount of unnecessary information 
and fail to capture the user language. For example, it is 
common for non-English speakers to type their messages in 
their mother tongue using Latin characters. These dialects 
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use words that can not be found in common dictionaries, 
since they differ from user to user and there is no common 
grammar. As a consequence, ‘auto-complete’ solutions have 
disappointed users even leading them into creating memes 
discussing these issues pj] . 

We introduce Anima, an SK that captures its owner’s 
unique dictionary in order to produce personalized typing 
predictions. Anima is characterized by the following novel 
features: 

1. There is no requirement for a predefined dictionary 

2. The learning process is accelerated by considering the 
user’s past written communication 

3. A low memory footprint is achieved, by forgetting 
words that the user does not use any more, and finally 

4. Anima learns user’s daily patterns and produces time- 
aware predictions 

2. SYSTEM OVERVIEW 

Architecture. According to Human Computer Interaction 
(HCI) research, word suggestion engines (‘auto-complete’) 
decrease the user’s typing speed without reducing error rates 
[^. Hence, we focus on predicting the next eharaeter that 
the user will most probably type. The architecture of Anima 
decouples the User Interface (UI) from the Prediction Engine 
(PE). In this way, we provide the opportunity to the HCI 
research community to develop new UI approaches that will 
utilize the existing PE. Hence, we defined a simple contrac 
that the UI must follow: 

1. Inform PE about the typed character 

2. Request predictions for the next character 

3. Visually exploit the predicted character set that PE 
returns 

4. Send feedback to PE when a prediction is not accurate 

The system overview of Anima is depicted in Figure 

User Interface. Anima’s UI, is inspired by recent HCI re¬ 
search shrinking the keyboard buttons that are not in¬ 
cluded in the PE’s prediction set. As a result, it visually 
aids the user to type the desired characters (see Figure [^. 
In addition, shrinking buttons result to larger empty spaces 
between keys preventing mistypes. In order send feedback 
for a bad prediction the user can swipe diagonally. Finally, 
the user can hide the keyboard by swiping down. 
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Figure 2: Anima UI interaction while user is typing the word Dog 



Figure 1: Anima Twitter Accelerator 


3. PREDICTION ENGINE 

The Prediction Engine is the module that predicts the 
next character that the user will type. Its main structure 
is based on a Trie data structure [^, with weights in each 
node. Let’s assume a stream of user key strokes at each 
time-step t: {ai,..., at,...}, where at ^ K and K is the 
set of all possible keys in a keyboard. A corresponding 
stream of prediction-sets is produced by the prediction en- 
gine where Gt = {g(t,i), ■ ■ ■, g(t,nt)}^ with 

g E K and n the size of the prediction set. Note that t in 
rit implies that the size of the prediction set is varying (see 
next paragraphs). 

Learning and Predicting. Each time the user types 
a character at, PE traverses the Trie from the root, by de¬ 
scending one level every time a character is typed. The 
weights in each node, represent how many times the user 
has traversed this path. The weighted Trie (WT) tracks the 
traversed paths and adds words that the user types for the 
hrst time. A cursor (c) points to the current level of the Trie. 
Every time the user types a word separator, c is redirected 
to the root of WT. In order to produce predictions for the 
next character Gt+i, given the sequence of characters that 
have been typed after the last word separator (word IT), 
Anima computes the conditional probabilities P{k \ W) for 
every k E K, based on the weights of the nodes at c. Then, 
the top-nt characters according to P are returned. The time 
complexity of computing all probabilities of the next char¬ 


acter is constant, 0(1). This results from the following: 

• The limited number of probability calculations (i.e. 
size of iL) and 

• The fact that these computations consider information 
that is available within the node of ITT (instant ac¬ 
cess) 

Constant time complexity is crucial for such applications, 
since any propagated delay to the UI level, would annoy the 
user and hurt the usability of the keyboard. 

Confidence and Diffidence. An upper bound rit is de- 
hned, representing the maximum number of predicted char¬ 
acters at time t. This upper bound is dynamic and is re- 
hned automatically and continuously. It is associated with 
the conhdence of the PE. The time that rit is adjusted, is 
dehned by two parameters, conf (conhdence) and dif f (dif- 
hdence) associated with bad prediction feedback from the 
UI. After conf consecutive successful predictions, rit is de¬ 
creased by one, resulting to a more aggressive (conhdent) 
behaviour shrinking k — rit keys. On the other hand, when 
PE gets diff continuous negative feedbacks, rit is increased 
by one, in order to shrink a smaller set of characters. 
Learning Accelerators. In order to accelerate the learn¬ 
ing process. Accelerators can be added to the PE. Accelera¬ 
tors are messages written by the user in the past. Currently 
one Accelerator has been implemented, shown in Eigure[^ 
which crawls the user’s Twitter account and feeds his/her 
messages to the engine, in order to train it. 

Pruning. Contemporary SKs come with a pre-installed dic¬ 
tionary and learn words that the user is typing and are not 
included in the dictionary. Google’s Android SK contains 
a dictionary of 160,722 words for the English UK language 
and its footprint continues to increase as the user types 
new words (see Eigure|^). In order to address the increas¬ 
ing volume of the dictionary, Anima prunes ITT, based on 
a Least Recently Used algorithm [^. As a result, it main¬ 
tains a dictionary that is more suitable to the user, since it 
consists of more relevant recently typed words. 
Time-awareness: Anima incorporates Time-Awareness 

(TA) in the PE, in order to incorporate user daily habits 
into the predictions. This is done by partitioning a day into 
T partitions, and by keeping a ITT per partition, repre¬ 
sented as ITTt, t G [0,1,...,T]. During a partition t, the 
respective ITTt is active. Our experimental results (Section 
, showed that the invocation of the time feature improved 
the accuracy of Anima. 
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Multilingual Support: PE is language agnostic, which 
means that in order to add a new language only the UI 
level has to be updated with the new character layout. This 
is a result of the way the PE handles the characters and 
the predictions, since WTs are independent of languages. 
Einally, in the case of a user typing in multiple languages, for 
each supported language there has to be a different instance 
of the PE. 
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Figure 3: Anima overview 


4. EVALUATION 

Dataset. We utilized a dataset with messages typed 
through mobile devices. More specifically we collected Twit¬ 
ter messages and kept only the ones containing geo-location 
information. This information is only available when a user 
posts through GPS-enabled devices (smart phones, tablets, 
etc). We crawled Twitter using the Streaming API for 
the period between February 16, 2014 and April 6, 2014- 
Tweets were collected using a bounding box around the 
United Kingdom, resulting in a dataset with over 27 mil¬ 
lion tweets. We selected the top 100 authors in terms of 
tweet count. In addition, we developed a simulator of the 
Anima engine (source code available here [^), in order to sim¬ 
ulate the users’ typing behavior using Anima. Retweets and 
hyperlinks were ignored, because they do not contain words 
and characters actually typed by the users. Instead, men¬ 
tions to other users were taken into account, because they 
can be considered as nicknames that the user uses to refer 
to a friend. Einally, we manually filtered-out accounts that 
are automated systems (e.g. weather broadcasting accounts, 
bots, or news broadcasting accounts). Messages were sorted 
in a chronological order. We simulated users typing through 
Anima, by using multiple-sized training sets containing 0 to 
1500 tweets, with a step of 50 tweets. The test sets consist 
of the following 500 messages. 

Results: We evaluated the predictive accuracy of Anima, by 
computing the hit ratio over how many predicted char¬ 
acters were returned in each prediction. The ideal predic¬ 
tion engine would always predict exactly one character which 
would be the correct one, resulting in a precision of 1. 

Eor the experiments, we used the two following variations 
of Anima: 


Algorithm 1 Time-Aware typing predictions 

Input: 

ch: Key typed. 

feedback: Bad Prediction Eeedback. 

Return: 

P\ List of predicted next keys along with their respective 
probability. 

1: time ^ clock.Now 
2: if feedback then 
3: idle ^ true 

4: dif fidenceQ 

5: else 

6: confidencei) 

7: end if 
8: if idle then 

9: cursor ^ TimeTries[time\ 

10: cursor .Add{ch) 

11: cursor ^ cursor[ch] 

12: if isWordSeparator(c/i) then 

13: idle ^ false 

14: TimeTries[time] ^ TimeTries[time].Root 

15: return 0 

16: end if 

17: end if 

18: cursor ^ TimeTries[time] 

19: cursor ^ cursor[ch] 

20: for each ch G cursor do 
21: P ^ PU popularity(c/i) 

22: end for 

23: if isWordSeparator(c/i) then 

24: TimeTries[time\ ^ TimeTries[time].Root 

25: end if 

26: return normalize(P) 


• Anima with Google’s dictionary for the UK English lan¬ 
guage consisting of 160, 722 words. Accelerator- 
disabled, TA-disabled and Pruning-disabled. 

• Anima with no initial dictionary. Accelerators dis¬ 
abled, TA-enabled and Pruning-enabled. 

The dictionary approach, performed worse than Anima, 
which had no initial dictionary, even when using small train¬ 
ing sets. Results are depicted in Eigure [^. The TA ap¬ 
proaches performed better than the simple ones and as the 
number of the time partitions increased, the precision also 
increased. The results of Anima with increasing number of 
time partitions can be found in Table ^ Anima without 
Pruning performed worse than the version with Pruning, 
but the losses were covered from the gain of the TA. 

Moreover, we studied the size of the words learned when 
Pruning was disabled and we observed that the number of 
words was increasing with the training set size. Eigure 
depicts a comparison of the different methods with respect 
to the number of used words as a function of the training 
set size. Even for the Google’s dictionary of 160,722 words, 
Anima continued learning new words as the training set in¬ 
creased, as depicted in Eigure]^ 

5. CONCLUSION AND FUTURE WORK 

In this paper we introduce Anima, an adaptive software 
keyboard for touchscreen devices that learns the user’s dic¬ 
tionary and habits, in order to produce more accurate char¬ 
acter predictions. During the demonstration, the users will 
be able to test Anima with a variety of touchscreen devices 
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• TA and Pruning, no initial dictionary 
— Android dictionary, no TA and Pruning 


(a) Precission 



° TA, no initial dictionary and Pruning 
• TA and Pruning, no initial dictionary 
— Android dictionary, no TA and Pruning 
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Figure 4: Results 
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Table 1: Mean Precision for Anima variations, with¬ 
out any Accelerator and no initial dictionary 

and settings by providing their Twitter username, so that 
Anima can accelerate learning of their personal dictionary. 
We plan to incorporate location features and knowledge 
exchanging between users that have similar behavior, e.g. 
users that are co-workers or are connected through a real-life 
relationship. Finally, we intend to use more Acceleration 
sources, like chat messages, e-mails and social networking 
services. 

An early prototype of Anima, without the Twitter Accel¬ 
erator and the TA feature, is publicly available at Google’s 
Play Store [^, with more than 9000 installations, a rating 
of 4 out of 5, among 72 users and 196 Google +ls. Source 
code is available at [^. 


2. Anima without Time-Aware or Acceleration and with 
Google’s Android dictionary as initial dictionary 

and compare them by using them. 
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6. DEMONSTRATION 

Attendants will be able to use two software keyboards: 

1. Time-Aware Anima, after being Accelerated by their 
Twitter account and 
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